Polysemous Codes

نویسندگان

  • Matthijs Douze
  • Hervé Jégou
  • Florent Perronnin
چکیده

This paper considers the problem of approximate nearest neighbor search in the compressed domain. We introduce polysemous codes, which offer both the distance estimation quality of product quantization and the efficient comparison of binary codes with Hamming distance. Their design is inspired by algorithms introduced in the 90’s to construct channel-optimized vector quantizers. At search time, this dual interpretation accelerates the search. Most of the indexed vectors are filtered out with Hamming distance, letting only a fraction of the vectors to be ranked with an asymmetric distance estimator. The method is complementary with a coarse partitioning of the feature space such as the inverted multi-index. This is shown by our experiments performed on several public benchmarks such as the BIGANN dataset comprising one billion vectors, for which we report state-of-the-art results for query times below 0.3 millisecond per core. Last but not least, our approach allows the approximate computation of the k-NN graph associated with the Yahoo Flickr Creative Commons 100M, described by CNN image descriptors, in less than 8 hours on a single machine.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

EFL Translation Students' Perspective toward Using Bilingual Dictionary in Translation of Polysemous Words

This research presented the use of bilingual dictionary and addressed the EFL translation students' points of view on the use of bilingual dictionary in translating polysemous words (English to Persian). Moreo- ver, it aimed at finding the possible relationship between the effect of using bilingual dictionary by stu- dents in translating polysemous words and their achieved scores. In the study ...

متن کامل

Semantic Structures of Polysemous Psych-adjectives in Korean: A Conceptual Semantics Approach

Although researches have been conducted on the polysemous nature of some Korean psych-adjectives, no consensus has been made on the criteria used for evaluating the polysemy. Furthermore, few formalizations (semantic structures) have been proposed for the polysemous phenomena. The purpose of this paper is twofold: 1) to propose new criteria for distinguishing polysemous psych-adjectives from mo...

متن کامل

Reducing Lexical Semantic Complexity With Systematic Polysemous Classes And Underspecification

This paper presents an algorithm for finding systematic polysemous classes in WordNet and similar semantic databases, based on a definition in (Apresjan 1973). The introduction of systematic polysemous classes can reduce the amount of lexical semantic processing, because the number of disambiguation decisions can be restricted more clearly to those cases that involve real ambiguity (homonymy). ...

متن کامل

Journal of Memory and Language

Th and Dell Kari com A Insti Urba sented distinctly in the lexicon or if there is a common, core meaning. In all experiments, a polysemous word was used twice, in phrases that selected the same or different senses. Experiment 1 showed that sense consistency aided memory for the polysemous word. Experiment 2 extended this result to a timed sensicality judgment task. Experiment 3 demonstrated tha...

متن کامل

Understanding words in context: the role of Broca's area in word comprehension.

What role does meaning selection play in word comprehension, and what neural systems support this selection process? Most words have multiple meanings and are therefore ambiguous. This is true of both homonymous words (words that have multiple unrelated meanings) and polysemous words (words that have multiple related meanings). The extant evidence indicates that meaning selection is an integral...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016